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In this paper, we consider the problem of sum rate maximization in a bidirectional relay network with fading. 

Hereby, user 1 and user 2 communicate with each other only through a relay, i.e., a direct link between user 1 and 

user 2 is not present. In this network, there exist six possible transmission modes: four point-to-point modes (user 

1-to-relay, user 2-to-relay, relay-to-user 1, relay-to-user 2), a multiple access mode (both users to the relay), and a 

broadcast mode (the relay to both users). Most existing protocols assume a fixed schedule of using a subset of the 

aforementioned transmission modes, as a result, the sum rate is limited by the capacity of the weakest link associated 

with the relay in each time slot. Motivated by this limitation, we develop a protocol which is not restricted to adhere 

to a predefined schedule for using the transmission modes. Therefore, all transmission modes of the bidirectional 

relay network can be used adaptively based on the instantaneous channel state information (CSI) of the involved 

links. To this end, the relay has to be equipped with two buffers for the storage of the information received from 

users 1 and 2, respectively. For the considered network, given a total average power budget for all nodes, we jointly 

optimize the transmission mode selection and power allocation based on the instantaneous CSI in each time slot for 



sum rate maximization. Simulation results show that the proposed protocol outperforms existing protocols for all 
signal-to-noise ratios (SNRs). Specifically, we obtain a considerable gain at low SNRs due to the adaptive power 
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allocation and at high SNRs due to the adaptive mode selection. 



H ' I. Introduction 

In a bidirectional relay network, two users exchange information via a relay node 0"). Several protocols have 
been proposed for such a network under the practical half-duplex constraint, i.e., a node cannot transmit and receive 
at the same time and in the same frequency band. The simplest protocol is the traditional two-way relaying protocol 
in which the transmission is accomplished in four successive point-to-point phases: user 1-to-relay, relay-to-user 2, 
user 2-to-relay, and relay-to-user 1. In contrast, the time division broadcast (TDBC) protocol exploits the broadcast 
capability of the wireless medium and combines the relay-to-user 1 and relay-to-user 2 phases into one phase, 
the broadcast phase [2|. Thereby, the relay broadcasts a superimposed codeword, carrying information for both 
user 1 and user 2, such that each user is able to recover its intended information by self-interference cancellation. 
Another existing protocol is the multiple access broadcast (MABC) protocol in which the user 1-to-relay and user 
2-to-relay phases are also combined into one phase, the multiple-access phase [3|. In the multiple-access phase, 
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both user 1 and user 2 simultaneously transmit to the relay which is able to decode both messages. Generally, 
for the bidirectional relay network without a direct link between user 1 and user 2, six transmission modes are 
possible: four point-to-point modes (user 1-to-relay, user 2-to-relay, relay-to-user 1, relay-to-user 2), a multiple 
access mode (both users to the relay), and a broadcast mode (the relay to both users), where the capacity region 
of each transmission mode is known J4), 0. Given this knowledge, there has been a great research interest in 
obtaining the achievable rate region and the capacity of the bidirectional relay network with and without fading 
|[T]- (8). The capacity of the bidirectional relay network without a direct link between user 1 and user 2 has been 
obtained for additive white Gaussian noise (AWGN) channels without fading, whereas when fading is present, only 
achievable rates are known. One of the reasons that previous protocols cannot achieve the capacity is that they have 
a fixed schedule of using the transmission modes in each time slot without taking into account the instantaneous 
channel state information (CSI) of the involved links. For one-way relaying, an adaptive link selection protocol was 
proposed in |9| where based on the instantaneous CSI, in each time slot, either the source-relay or relay-destination 
links are selected for transmission. To this end, the relay has to have a buffer for data storage. This strategy was 
shown to achieve the capacity of the one-way relay channel with fading [10|. 

Moreover, in fading AWGN channels, power control is necessary for rate maximization. The highest degree of 
freedom that is offered by power control is obtained for a joint average power constraint for all nodes. Any other 
power constraint with the same total power budget is more restrictive than the joint power constraint and results in 
a lower sum rate. Therefore, motivated by the protocols in [9| and [10|, our goal is to utilize all available degrees 
of freedom of the three-node half-duplex bidirectional relay network with fading, via an adaptive mode selection 
and power allocation policy. In particular, given a joint power budget for all nodes, we find a policy which in each 
time slot selects the optimal transmission mode from the six possible modes and allocates the optimal powers to 
the nodes transmitting in the selected mode, such that the sum rate is maximized. 

Adaptive mode selection for bidirectional relaying was also considered in [8| and ifTTl . However, the selection 
policy in JH] does not use all possible modes, i.e., it only selects from two point-to-point modes and the broadcast 
mode, and assumes that the transmit powers of all three nodes are fixed and identical. Although the selection policy 
in IfTTl considers all possible transmission modes for adaptive mode selection, the transmit powers of the nodes are 
assumed to be fixed, i.e., power allocation is not possible. Interestingly, mode selection and power allocation are 
mutually coupled and the modes selected with the protocol in [11] for a given channel are different from the modes 
selected with the proposed protocol. Power allocation can considerably improve the sum rate by optimally allocating 
the powers to the nodes based on the instantaneous CSI especially when the total power budget in the network 
is low. Moreover, the proposed protocol achieves the maximum sum rate in the considered bidirectional network. 
Hence, the sum rate achieved with the proposed protocol can be used as a reference for other low complexity 
suboptimal protocols. Simulation results confirm that the proposed protocol outperforms existing protocols. 

Finally, we note that the advantages of buffering come at the expense of an increased end-to-end delay. However, 
with some modifications to the optimal protocol, the average delay can be bounded, as shown in [9], which causes 
only a small loss in the achieved rate. The delay analysis of the proposed protocol is beyond the scope of the 
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Fig. 1. Three-node bidirectional relay network consisting of two users and a relay. 
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Fig. 2. The six possible transmission modes in the considered bidirectional relay network. 



current work and is left for future research. 



II. System Model 

In this section, we first describe the channel model. Then, we provide the achievable rates for the six possible 
transmission modes. 

A. Channel Model 

We consider a simple network in which user 1 and user 2 exchange information with the help of a relay node 
as shown in Fig. 1. We assume that there is no direct link between user 1 and user 2, and thus, user 1 and user 
2 communicate with each other only through the relay node. We assume that all three nodes in the network are 
half-duplex. Furthermore, we assume that time is divided into slots of equal length and that each node transmits 
codewords which span one time slot or a fraction of a time slot as will be explained later. We assume that the user- 
to-relay and relay-to-user channels are impaired by AWGN with unit variance and block fading, i.e., the channel 
coefficients are constant during one time slot and change from one time slot to the next. Moreover, in each time 
slot, the channel coefficients are assumed to be reciprocal such that the user 1-to-relay and the user 2-to-relay 
channels are identical to the relay-to-user 1 and relay-to-user 2 channels, respectively. Let hi(i) and fi2(i) denote 
the channel coefficients between user 1 and the relay and between user 2 and the relay in the i-th time slot, 
respectively. Furthermore, let Si(i) = |/ii(i)| 2 and S^M = |^2(*)| 2 denote the squares of the channel coefficient 
amplitudes in the i-th time slot. S\(i) and ^(i) are assumed to be ergodic and stationary random processes with 
means Q\ = E{Si} and Q2 — ^{^JlI respectively, where E{-} denotes expectation. Since the noise is AWGN, 
in order to achieve the capacity of each mode, nodes have to transmit Gaussian distributed codewords. Therefore, 
the transmitted codewords of user 1, user 2, and the relay are comprised of symbols which are Gaussian distributed 
random variables with variances P\{i), ^2(2)' an ^ Pr{i), respectively, where Pj(i) is the transmit power of node 

'in this paper, we drop time index % in expectations for notational simplicity. 
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j G {1, 2, r} in the i-th time slot. For ease of notation, we define C(x) = log 2 (l + x). In the following, we describe 
the transmission modes and their achievable rates. 

B. Transmission Modes and Their Achievable Rates 

In the considered bidirectional relay network only six transmission modes are possible, cf. Fig. [2] The six possible 
transmission modes are denoted by Mi, ..., A4q, and Rjj>(i) > 0, j,j' £ {l,2,r}, denotes the transmission rate 
from node j to node j' in the i-th time slot. Let B\ and B2 denote two infinite-size buffers at the relay in which 
the received information from user 1 and user 2 is stored, respectively. Moreover, Qj(i), j G {1 7 2}, denotes the 
amount of normalized information in bits/symbol available in buffer Bj in the i-th time slot. Using this notation, 
the transmission modes and their respective rates are presented in the following: 

Mi: User 1 transmits to the relay and user 2 is silent. In this mode, the maximum rate from user 1 to the relay in 
the i-th time slot is given by R\ r {i) = Ci r (i), where Ci r (i) ~ C(Pi(i)Si(i)). The relay decodes this information 
and stores it in buffer B±, Therefore, the amount of information in buffer B\ increases to Qi(i) = Qi(i— 1)+Ri r (i). 

M.-1- User 2 transmits to the relay and user 1 is silent. In this mode, the maximum rate from user 2 to the relay in 
the i-th time slot is given by Ri r (%) = C% r {i), where C% r {i) = C(P2(i)S2{i))- The relay decodes this information 
and stores it in buffer i?2- Therefore, the amount of information in buffer B 2 increases to Q<2,{%) — Q2{i~^)+R2r{i)- 

M3: Both users 1 and 2 transmit to the relay simultaneously. For this mode, we assume that multiple access 
transmission is used, see [5|. Thereby, the maximum achievable sum rate in the i-th time slot is given by R\ r (i) + 
Rir{i) — C r (i), where C r (i) — C(Pi(i)Si(i)+P2{i)S2(i))- Since user 1 and user 2 transmit independent messages, 
the sum rate, C r (i), can be decomposed into two rates, one from user 1 to the relay and the other one from user 
2 to the relay. Moreover, these two capacity rates can be achieved via time sharing and successive interference 
cancelation. Thereby, in the first < t(i) < 1 fraction of the i-th time slot, the relay first decodes the codeword 
received from user 2 and considers the signal from user 1 as noise. Then, the relay subtracts the signal received from 
user 2 from the received signal and decodes the codeword received from user 1. A similar procedure is performed 
in the remaining 1 — i(i) fraction of the i-th time slot but now the relay first decodes the codeword received from 
user 1 and treats the signal of user 2 as noise, and then decodes the codeword received from user 2. Therefore, for 
a given t(i), we decompose C r (i) as C r (i) — Ci2- r (i) + C , 2i r («) and the maximum rates from users 1 and 2 to the 
relay in the i-th time slot are i?i r (i) = Ci2 r (i) and R2r{i) = C2ir(*X respectively. Ci2 r (i) and C2i r (*) are given 
by 

Cr iar (i) = t(i)C(Pi(i)Si(t))+(l-*(»))cf T ^^^ > ) (la) 

\l + P2(l)S2{t)J 

c 21r (i) - (1 - t(i))c (P 2 (i)s 2 (i))+t(i)c ( -M^J ) (ib) 

The relay decodes the information received from user 1 and user 2 and stores it in its buffers B\ and B2, respectively. 
Therefore, the amounts of information in buffers B\ and B2 increase to Qi(i) = Qi(i — 1) + Ri r (i) and Q2W = 
Q2{i — 1) + R2r{i), respectively. 
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A44- The relay transmits the information received from user 2 to user 1. Specifically, the relay extracts the 
information from buffer B2, encodes it into a codeword, and transmits it to user 1. Therefore, the transmission rate 
from the relay to user 1 in the i-th time slot is limited by both the capacity of the relay-to-user 1 channel and 
the amount of information stored in buffer B^. Thus, the maximum transmission rate from the relay to user 1 is 
given by -R r i(i) = min{C r i(i), Qi(i — 1)}, where CVi(i) = C(P r (i)Si(i)). Therefore, the amount of information 
in buffer Bi decreases to Q2{i) = Q2{i — 1) — Rri{i)- 

Ai 5 : This mode is identical to M4 with user 1 and 2 switching places. The maximum transmission rate from 
the relay to user 2 is given by Rri{i) = min{C r 2(i), Qi(i — 1)}, where C r 2(i) = C(P r (i)S2{i)) and the amount 
of information in buffer B\ decreases to Qi(i) = Qi(i — 1) — R r 2{i)- 

M.q. The relay broadcasts to both user 1 and user 2 the information received from user 2 and user 1, respectively. 
Specifically, the relay extracts the information intended for user 2 from buffer B\ and the information intended for 
user 1 from buffer B2- Then, based on the scheme in |4|, it constructs a superimposed codeword which contains 
the information from both users and broadcasts it to both users. Thus, in the i-th time slot, the maximum rates from 
the relay to users 1 and 2 are given by R rl (i) = mm{C r i(i),Q2(i — 1)} and R r 2(i) = niin{C r 2(i), <2i(i — 1)}, 
respectively. Therefore, the amounts of information in buffers B x and B 2 decrease to Q\(i) — Qi{i — 1) — R r 2{i) 
and Q2(i) = Q2{i — 1) — Rri{i), respectively. 

Our aim is to develop an optimal mode selection and power allocation policy which in each time slot selects 
one of the six transmission modes, .Mi, ..., M.§, and allocates the optimal powers to the transmitting nodes of the 
selected mode such that the average sum rate of both users is maximized. To this end, we introduce six binary 
variables, qk{i) <2 {0, 1}, k = 1, ...,6, where qk{i) indicates whether or not transmission mode Mk is selected in 
the i-th time slot. In particular, qk{i) = 1 if mode M.^ is selected and qk{i) = if it is not selected in the i-th 
time slot. Furthermore, since in each time slot only one of the six transmission modes can be selected, only one 
of the mode selection variables is equal to one and the others are zero, i.e., X)fc=i Qk{i) = 1 holds. 

In the proposed framework, we assume that all nodes have full knowledge of the CSI of both links. Thus, based 
on the CSI and the proposed protocol, cf. Theorem each node is able to individually decide which transmission 
mode is selected and adapt its transmission strategy accordingly. 

III. Joint Mode Selection and Power Allocation 

In this section, we first investigate the achievable average sum rate of the network. Then, we formulate a 
maximization problem whose solution is the sum rate maximizing protocol. 

A. Achievable Average Sum Rate 

We assume that user 1 and user 2 always have enough information to send in all time slots and that the number 
of time slots, N, satisfies N — > 00. Therefore, using qk{i), the user 1-to-relay, user 2-to-relay, relay-to-user 1, and 
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relay-to-user 2 average transmission rates, denoted by R\ r , R 2r , R r \, and R r 2, respectively, are obtained as 

N 



R lr = lim — y^[qi(i)C lr (i) + q s (i)Ci2r(i)} (2a) 

TV— >ooiv *• — ' 
i=l 

1 N 

R 2r = lim — V [q 2 (i)C 2r {i) + « 3 (t)C 2 ir(t)] (2b) 

N — f oo IV z — * 

4=1 

1 w 

R r i= lim — y^ [g 4 (i)+g6(«)]min{Cr.i(i),(32(i - 1)} (2c) 

N— s-ooiV-' — ' 
i=l 

1 ^ 

^2= lim — V" [q 5 (i)+q 6 (i)]mm{C r2 (i),Qi(i- 1)}- (2d) 

i=l 

The average rate from user 1 to user 2 is the average rate that user 2 receives from the relay, i.e., R r2 . Similarly, the 
average rate from user 2 to user 1 is the average rate that user 1 receives from the relay, i.e., R r \. In the following 
theorem, we introduce a useful condition for the queues in the buffers of the relay leading to the optimal mode 
selection and power allocation policy. 

Theorem 1 (Optimal Queue Condition): The maximum average sum rate, R r i + R r2 , for the considered bidi- 
rectional relay network is obtained when the queues in the buffers B\ and B 2 at the relay are at the edge of 
non-absorbtion. More precisely, the following conditions must hold for the maximum sum rate 

1 N 
R lr =R r2 = Km —}[q5(i) + q 6 (i))C r 2(i) (3a) 

i=l 
1 N 

R 2r = R rl = lim — y [q 4 (i) + q 6 (i)} C rl (i) (3b) 

./V=>ooiV *• — ' 
i=l 

where _R lr and R 2r are given by (f2^) and (fJJ)), respectively. 

Proof Please refer to 0j] Appendix A]. 

Using this theorem, in the following, we derive the optimal transmission mode selection and power allocation policy. 

B. Optimal Protocol 

The available degrees of freedom in the considered network in each time slot are the mode selection variables, 
the transmit powers of the nodes, and the time sharing variable for multiple access. Herein, we formulate an 
optimization problem which gives the optimal values of qk{i), Pj(i)> an d t(i), for k = 1, ..., 6, j — 1, 2, r, and Vi, 
such that the average sum rate of the users is maximized. The optimization problem is as follows 

maximize R\ r + R 2r 
subjected to CI : R\ r = R r2 

02 : R 2 r = Rrl 

C3 : P 1 +P 2 + P r < P t 
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C4: E fc=1 3*(*) = 1 ' v * 
C5: ft (t)[l -«*(*)]= 0, Vt,* 

C6: Pj(i)>0, Vi,j 

C7 : < t{i) < 1, Vi (4) 

where P* is the total average power constraint of the nodes and Pi , Pa, and P r denote the average powers consumed 
by user 1, user 2, and the relay, respectively, and are given by 

1 N 
^ = JvE(9 1 W + 8s('))A(») (5a) 

4=1 
1 - 

^ = -jy !>»(») + 33(*))-*M«) (5b) 

i=l 



Pt = jj £(ff4(t) + <fc(t) + qe(i))Pr(i)- (5c) 

i=l 

In the optimization problem given in (0J, constraints CI and C2 are the conditions for sum rate maximization 
introduced in Theorem Q] Constraints C3 and C6 are the average total transmit power constraint and the power 
non-negativity constraint, respectively. Moreover, constraints C4 and C5 guarantee that only one of the transmission 
modes is selected in each time slot, and constraint C7 specifies the acceptable interval for the time sharing variable 
t(i). Furthermore, we maximize Ri r + R,2 r since, according to Theorem 1 (and constraints CI and C2), R± r = R r 2 
and B,2r — R r i hold. 

In the following Theorem, we introduce a protocol which achieves the maximum sum rate. 

Theorem 2 (Mode Selection and Power Allocation Policy): Assuming N — > oo, the optimal mode selection and 
power allocation policy which maximizes the sum rate of the considered three-node half-duplex bidirectional relay 
network with AWGN and block fading is given by 

1, i,r=a tgm a. { A t ( !)} 

,„. (i)={ k=l,2,»fi (6) 

0, otherwise 
where Afc(») is referred to as selection metric and is given by 

AiOO = (1 - m)Ci r {i) - jPi(i)\ Pl(i) = p M 1(i) (7a) 

A 2 (») = (1 - f i 2 )C 2 r(i) - 7^(*)U w =^» (i) ( 7b ) 

A 3 (i) = (1 - /il)Ci2r(i) + (1 - m)C 2 lr(i) 

- 7 (Pi« + P 2 «) 



P 2 (i)=P^(i) 



(7c) 



Ae(i) = li.\C r2 {i) + fJvC r i(i) - 1 p r{i)\ Pr{i)=p M 6 {i) (7d) 
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where P k {i) denotes the optimal transmit power of node j for transmission mode M k in the i-th time slot and 
is given by 



pr i (i) 



>M 



P^(i) 



7 ln2 Si(i) 

l-/i 2 i_ 

7ln2 ~ 



pM 3 , , , _ 



(0 



l~Mi 
7 In 2 



S 2 (i) 



i + 



S 2 («) 1 



3X3 

2 



Po 7W3 (i) 



J-Me 



(0 



7ln2 Si(i) j 



7ln2 1 _ s 2(') 



1-M2 _ 

7ln2 
6 + VF 



sTF) 



, if fti > ft 2 



, otherwise 



S 2 (i) 

1 



7ln2 1 %(') 



,if fii > ft 2 



-1 + 



, otherwise 



4ac 



2« 



(8a) 
(8b) 

(8c) 

(8d) 

(8e) 



where [x} + = max{x, 0}, a = 7 ln2 x Si(i)S 2 (i),b = 7 ln2 x (Si(i) + S 2 (i)) - (m + ^2)Si(i)S 2 (i), and c = 
7ln2 — fiiS2(i) — H2Si(i). The thresholds [i\ and /12 are chosen such that constraints CI and C2 in (0]i hold and 
threshold 7 is chosen such that the total average transmit power satisfies C3 in ©. The optimal value of t(i) in 
Ci2r{i) and C 2 i r (i) is given by 



t*(i) 




(9) 



Proof Please refer to Appendix [A] 

We note that the optimal solution utilizes neither modes .M4 and A4$ nor time sharing for any channel statistics 
and channel realizations. 

Remark 1: The mode selection metric Aj.(i) introduced in (0 has two parts. The first part is the instantaneous 
capacity of mode M k , and the second part is the allocated power with negative sign. The capacity and the power 
terms are linked via thresholds /ii and/or /12 and 7. We note that thresholds /xi, fi2, and 7 depend only on the long 
term statistics of the channels. Hence, these thresholds can be obtained offline and used as long as the channel 
statistics remain unchanged. To find the optimal values for the thresholds fix, fj,2, and 7, we need a three-dimensional 
search, where /Xi,/i2 € (0 1) and 7 > 0. 

Remark 2: Adaptive mode selection for bidirectional relay networks under the assumption that the powers of the 
nodes are fixed is considered in Bill , Based on the average and instantaneous qualities of the links, all of the six 
possible transmission modes are selected in the protocol in IfTD . However, in the proposed protocol, modes M.<± 
and M.% are not selected at all. Moreover, the protocol in ifTTl utilizes a coin flip for implementation. Therefore, 
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Fig. 3. Maximum sum rate versus P t for different protocols. 

a central node must decide which transmission mode is selected in the next time slot. However, in the proposed 
protocol, all nodes can find the optimal mode and powers based on the full CSI. 

IV. Simulation Results 

In this section, we evaluate the average sum rate achievable with the proposed protocol in the considered 
bidirectional relay network in Rayleigh fading. Thus, channel gains Si (i) and 52 (i) follow exponential distributions 
with means Hi and fi^, respectively. All of the presented results were obtained for £1% = 1 and N = 10 4 time 
slots. 

In Fig. [3j we illustrate the maximum achievable sum rate obtained with the proposed protocol as a function of 
the total average transmit power P t . In this figure, to have a better resolution for the sum rate at low and high P t , 
we show the sum rate for both log scale and linear scale y-axes, respectively. The lines without markers in Fig. [3] 
represent the achieved sum rates with the proposed protocol for f2i = 1, 2, 5. We observe that as the quality of the 
user 1-to-relay link increases (i.e., Oj increases), the sum rate increases too. However, for large fii, the bottleneck 
link is the relay-to-user 2 link, and since it is fixed, the sum rate saturates. 

As performance benchmarks, we consider in Fig. |3]the sum rates of the TDBC protocol with and without power 
allocation |2| and the buffer-aided protocols presented in JS) and ifTTI . respectively. For clarity, for the benchmark 
schemes, we only show the sum rates for ill = f^- For the TDBC protocol without power allocation and the 
protocol in [8|, all nodes transmit with equal powers, i.e., P\ = P2 = P r = Pt- For the buffer-aided protocol in 
IfTTI . we adopt P\ = P2 = P r = P and P is chosen such that the average total power consumed by all nodes is P t . 
We note that since ill = Q2 and Pi = P2, the protocol in [11] only selects modes A4 3 and A4q. Moreover, since 
Hi = ^2, we obtain /ii = {12 in the proposed protocol. Thus, considering the optimal power allocation in (HJ:) and 
([HJi), we obtain that either Pj 3 (i) or P^ii) is zero. Therefore, for the chosen parameters, only modes Mi^M.2, 



May 1, 2013 



DRAFT 



10 

and M§ are selected, i.e., the same modes as used in [8|j. Hence, we can see how much gain we obtain due to the 
adaptive power allocation by comparing our result with the results for the protocol in [8|. On the other hand, the 
gain due to the adaptive mode selection can be evaluated by comparing the sum rate of the proposed protocol with 
the result for the TDBC protocol with power allocation. From the comparison in Fig. [5] we observe that for high 
Pt, a considerable gain is obtained by the protocols with adaptive mode selection (ours and that in [8|) compared 
to the TDBC protocol which does not apply adaptive mode selection (around 6 dB gain). However, for high P t , 
power allocation is less beneficial, and therefore, the sum rates obtained with the proposed protocol and that in J8) 
converge. On the other hand, for low P t , optimal power allocation is crucial and, therefore, a considerable gain is 
achieved by the protocols with adaptive power allocation (ours and TDBC with power allocation). 

V. Conclusion 

We have derived the maximum sum rate of the three-node half-duplex bidirectional buffer-aided relay network with 
fading links. The protocol which achieves the maximum sum rate jointly optimizes the selection of the transmission 
mode and the transmit powers of the nodes. The proposed optimal mode selection and power allocation protocol 
requires the instantaneous CSI of the involved links in each time slot and their long-term statistics. Simulation 
results confirmed that the proposed selection policy outperforms existing protocols in terms of average sum rate. 

Appendix A 
Proof of Theorem 2 (Mode Selection Protocol) 

In this appendix, we solve the optimization problem given in |@). We first relax the binary condition for qk(i), 
i.e., qk(i)[l — Qk(i)] = 0, to < qk(i) < 1, and later in Appendix iBl we prove that the binary relaxation does not 
affect the maximum average sum rate. In the following, we investigate the Karush-Kuhn-Tucker (KKT) necessary 
conditions [12] for the relaxed optimization problem and show that the necessary conditions result in a unique sum 
rate and thus the solution is optimal. 

To simplify the usage of the KKT conditions, we formulate a minimization problem equivalent to the relaxed 
maximization problem in (0|i as follows 

minimize —{R\ r + R,2 r ) 

qk{i),t(i), Vi,fc 

subject to CI : R\ r — R T 2 = 

C2 : i?2r ~~ Rrl = 

C3 : Pi+P 2 + P r - Pt < 

C4: EL*w~ 1 = ' Vi 

C5 : Q fc (i) - 1 <0, Mi,k 

2 We note that although the protocol in |8| outperforms the protocol in [11] if the sum rate is plotted as a function of total transmit power as 
is done in Fig. fj] the protocol in 1111 is optimal for given fixed node transmit powers. 
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£(Qk(i), t(i), m, 1, A (*X &k(i), Pk(i), 4>i{i)) = 

for Va,fe,Z 

-(Rlr + R 2 r) + Hl(Rlr ~ Rr 2 ) + Hl{R 2 r - flrl) + J (Pi + P2 + Pr ~ P t ) 

N / 6 \ N 6 N 6 

1=1 \fc=l / 1=1 fe=l 4=1 fe=l 

JV AT at 

- ^ M»)iM*) + v 2 {i)P 2 {i) + v 3 (i)P 3 (i)] + J2 &(*)(*(*) - 1) - E &(»)*(*) (ID 



C6 : -gjfe(i) <0, Vi, fc 

C7: -P,-(i) <0, Vi,fc 

C8 : t(i) - 1 < 0, Vi 

C9 : -t(i) < 0, Vi (10) 

The Lagrangian function for the above optimization problem is provided in ( fTTT i at the top of the next page 
where fJ,i,(A 2 , 7, X(i), otk(i), 0k{i), Vj(i), <f>i(i), and (j) 2 {i) are the Lagrange multipliers corresponding to constraints 
CI, C2, C3, C4, C5, C6, C7, C8, and C9, respectively. The KKT conditions include the following: 
1) Stationary condition: The differentiation of the Lagrangian function with respect to the primal variables, q k (i) , Pj (i), 
and t(i), Vi,j,k, is zero for the optimal solution, i.e., 

dC 



dq k (i) 

dC 
dPjii) 

dC 



0, V«,fc (12a) 

0, Vi,j (12b) 

0, Vi. (12c) 



&t(i) 

2) Primal feasibility condition: The optimal solution has to satisfy the constraints of the primal problem in (Hot . 

3) Dual feasibility condition: The Lagrange multipliers for the inequality constraints have to be non-negative, i.e., 

a k (i)>0, Vz,/c (13a) 

/8fc(»)>0, Vz,fc (13b) 

7>0, (13c) 

vj(i)>0, Vi,j (13d) 

0i(O > 0, V*,I. (13e) 

4) Complementary slackness: If an inequality is inactive, i.e., the optimal solution is in the interior of the corre- 
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sponding set, the corresponding Lagrange multiplier is zero. Thus, we obtain 



"fc (i) (% (*) - 1) =0, Vi,k 
Pk (i) qk (i) = 0, Vi, k 

7(A + A + Pr - P) = 

i/ J -(t)P i (i) = 1 Vi,j 

^i(*)(t(»)-l)=0, V^ 

fo(*)t(t) = 0, Vi. 



(14a) 

(14b) 
(14c) 
(14d) 
(14e) 
(14f) 



A common approach to find a set of primal variables, i.e., qk(i),Pj(i),t(i), \/i,j,k and Lagrange multipliers, 
i.e., /ii,/X2,7, A(i),afe(z),/8fe(i),fj(i),^i((i), Vi,k,l, which satisfy the KKT conditions is to start with the com- 
plementary slackness conditions and see if the inequalities are active or not. Combining these results with the 
primal feasibility and dual feasibility conditions, we obtain various possibilities. Then, from these possibilities, we 
obtain one or more candidate solutions from the stationary conditions and the optimal solution is surely one of 
these candidates. In the following subsections, with this approach, we find the optimal values of g|(j),P*(i), and 
**(*), Vi,j, k. 

A. Optimal ql(i) 

In order to determine the optimal selection policy, q%{i), we must calculate the derivatives in (fT2h). This leads 

to 



' " = --(l-^)C lr (z)+\(t)+ ai (t)-^(z) + - 7 Pi(i) = 



dqiii 
dC 



dq 2 (i 

dC 



dq 3 (i 
dC 



dqi{i 

dC 



dq 5 (i 

dC 



dq e {i 



= _l(i_ /i2 )C 2r (t)+A(«)+a 2 («)-j0a(t) + ^7P 2 «=0 

= -^[(l-Mi)C 12p (t)+(l-/* 2 )C 2 i r (i)]+A(t) + a 3 (i)- j 9 3 (»)+^7(Pi(*)+ft(»))=0 

= -^V2C r i(i)+\(i)+a i (i)-(3 i (i) + — 1 P r (i)=0 
= -—ViC r2 {i)+\(i)+a 5 (i)-p 5 (i) + —-{P r (i)=0 
= - — [lnC r2 (i)+Li2C rl (i)]+\{i) + a 6 (i)-p G {i) + — 1 P r {i)=Q. 



(15a) 
(15b) 
(15c) 
(15d) 
(15e) 
(15f) 



Without loss of generality, we first obtain the necessary condition for q*(i) = 1 and then generalize the result 
to ql(i) = 1, k = 2, . . . , 6. If q%(i) = 1, from constraint C4 in (fTOt . the other selection variables are zero, i.e., 
ql(i) =0, k = 2, ...,6. Furthermore, from dl4l . we obtain afe(i) =0, k = 2, ...,6 and /3i(i) = 0. By substituting 
these values into ( fTBI l, we obtain 



A(») + ai(t) - (1 - in)C lr (i) - jPi(i) = Ai(t) 



(16a) 
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A(t)-ft(t 
A(t) - ft (* 



= (l-/*2)C 2p (*)-7ft(i)4A 2 (t) 

= (l-Ml)Cl2rW + (l-M2)C 2 lrW - l(Pl W + ^W) = A 3 (i) 

= H2C r i(i) - jP r (i) = A 4 (i) 
= HiC r2 (i) - jP r (i) = A 5 («) 
= ViC r2 (i) + n 2 C r i(i) - jP r (i) = A 6 (t), 



(16b) 
(16c) 
(16d) 
(16e) 
(16f) 



where Afc(i) is referred to as selection metric. By subtracting dT6fr ) from the rest of the equations in ([TBI , we obtain 



Ai(») - A fc (*) = ai(i) + &(*), ft = 2, 3,4, 5,6. 



(17) 



From the dual feasibility conditions given in (fT3h ) and (fT3b). we have afc(j), 0k(i) > 0. By inserting afe(i), /3fc(i) > 
in (fTTT i. we obtain the necessary condition for ql(i) — 1 as 



Ai(i) > max {A 2 (i), A 3 (i), A 4 (i), A 5 (i), A 6 (i)} . 



(18) 



Repeating the same procedure for q%{i) = 1, fc = 2, . . . , 6, we obtain a necessary condition for selecting transmis- 
sion mode A4k* in the i-th time slot as follows 



Afc.(i)> max {Afc(i)}, 
fce{i,---,6} 



(19) 



where the Lagrange multipliers /ii,/i 2 , and 7 are chosen such that C1,C2, and C3 in ( TTOb hold and the optimal 
value of t(i) in Ci 2r (i) and C 2 i r (i) is obtained in the next subsection. We note that if the selection metrics are not 
equal in the i-th time slot, only one of the modes satisfies jl% . Therefore, the necessary conditions for the mode 
selection in (fT9l is sufficient. Moreover, in Appendix [B] we prove that the probability that two selection metrics 
are equal is zero due to the randomness of the time- continuous channel gains. Therefore, the necessary condition 
for selecting transmission mode A4k in ( fT9] l is in fact sufficient and is the optimal selection policy. 

B. Optimal P*(i) 

In order to determine the optimal Pj(i), we have to calculate the derivatives in (TlZb). This leads to 



dC 

dPiii) 



1 



Mn2 
Si(i 



{(1 - m)qi(i) - t(i)(pi - p 2 )q 3 [i)} 



l+A(t)Si(i 

Si CO 



- + {t(i)(fj, 1 -fi 2 ) + l-p 1 }q 3 (i) 



l + P 1 (i)S 1 (i) + P 2 (i)S 2 (i) 



+ 7jr(qi(i) + q 3 (i)) - vx{i) = 



(20a) 



_d£_ 



1 



Mn2 



{(i-A»2)«a(*)+(l-t(*))(A*i-M2)«a(t)} 



l+P 2 (i)S 2 (i 



- + {t(i)(fj, 1 -fi 2 ) + l-p 1 }q 3 (i) 
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i + p 1 (i)s 1 (i) + p 2 (i)s 2 (i) 

+ 7jy (©(*) + <?3(«)) - ^(i) = 



(20b) 



_ac_ 
ap r (o 



1 



/M<?4(«) + %(«)) 



S x (t) 



l + P r (t)Si(*) 



Mn2 
+ Mi (95 (») +96(0) 
+ 7^(<?4 00 + 95(«) + qe(i)) - v r (i) = 



l + P r (i)S 2 (i) 



(20c) 



The above conditions allow the derivation of the optimal powers for each transmission mode in each time slot. For 
instance, in order to determine the transmit power of user 1 in transmission mode Mi, we assume q*(i) = 1. From 
constraint C4 in dTOb . we obtained that the other selection variables are zero and therefore q^(i) = 0. Moreover, if 
.Mi is selected then Pi(i) 7^ and thus from (fT4h). we obtain v±(i) — 0. Substituting these results in d20h). we 
obtain 

P^(i) = 



1 -fix 



1 



(21) 



7ln2 S 1 (i)_ 

where [x] + — max{0,x}. In a similar manner, we obtain the optimal powers for user 2 in mode M. 2 , and the 
optimal powers of the relay in modes A4 4 and M5 as follows: 



pr a « 



1 



M2 



p™® 



7ln2 

/'2 



S 2 (i) 



1 



pr*(i) = 



7ln2 Si(i) 
Mi 1 



(22a) 
(22b) 
(22c) 



_ 7 ln2 Sa(i) 

In order to obtain the optimal powers of user 1 and user 2 in mode .M3, we assume q^(i) = 1. From C4 in ( [Tot , 
we obtain that the other selection variables are zero, and therefore q\(i) — and q 2 (i) = 0. We note that if one 
of the powers of user 2 and user 1 is zero mode M3 is identical to modes .Mi and Ai 2 , respectively, and for 
that case the optimal powers are already given by (|2H and (l22h ). respectively. For the case when P*(i) 7^ and 
P 2 *(i) 7^ 0, we obtain vl(i) = and ff (i) = from (TT4tl). Furthermore, for q$ (i) = 1, we will show in Appendix 
lAlC that t(i) can only take the boundary values, i.e., zero or one, and cannot be in between. Hence, if we assume 
t(i) = 0, from (120a) and d20"b). we obtain 

1-Mi Si(i) 



ln2 1 + Pi(i)Si(i) + P 2 (i)S 2 (i) 



1 
ln2 



(Mi " M2) 
+ 11 -Mi) 



&(*) 



1 + P 2 (i)5 2 (i) 
Sa(*) 



l + Pi(»)5i(t) + P2(t)S 2 (0 



7 = 



7 = 



(23a) 



(23b) 
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Sj(«) A 




A 3 (j) > Ai(«) 
s(0 > Aa(i) . 



Aa(i) > Ai(i) 
A 3 (i) > A 2 (i) 



Ar«-^:^>->A2(i) 




SaW A 



Ae(») ^ A 4 (i) 

I 

AeM I As(i) 



Si(i) 



— ► 
Si(i) 



a) Hi > \ii 



b) /i! < /i2 



Fig. 4. Comparison of the selection metrics in the space of (Si, S2)' a) Comparison of Ai(i), A2(i), and A3(i) if fii > fj,2, b) comparison 
of Ai(i), A2(i), and Aa(i) if fii < ^2, c) comparison of A4(i), A5(i), and Ag(i). 



By substituting d23h ) in d23b). we obtain P^ (i) and then we can derive Pj 3 (z) from (F23h). This leads to 



pr°(i) 




PTii) 



PT'W, 



M1-M2 



7 ln2 1 gjW S 2 (i) 

S 1 (i) 



, otherwise 



if S 2 > ^-Si 
+ 
, otherwise 



(24a) 



(24b) 



Similarly, if we assume t(i) — 1, we obtain 

p^(i), 



pr 3 (i) 



Ml— M2 



7 ln2 |lW_i Si(i) 



if 5 2 < r^Si 

z — 1— /X2 - 1 - 

+ 

, otherwise 



iT a (0, 



if S 2 > — £± 



pr°(i) = 



I'l ~)'2 o, 1 1 
-yln2 i l + i 



1 — ^2 M1~M2 1 

71n2 



7ln2 1 s 2(') 
Sl(i) 



(25a) 



(25b) 



, otherwise 



We note that when P± 3 (i) = P\ (i), we obtain P^ 3 (i) = which means that mode M.3 is identical to mode 
Mi. Thus, there is no difference between both modes so we select .Mi. In Figs [4] a) and[4]b), the comparison of 
Ai(i), Aa(i), and A^(i) is illustrated in the space of (Si(i), 82(1))- Moreover, the shaded area represents the region 
in which the powers of users 1 and 2 are zero for A4i,A / l2, and .M3. 

For mode A4q, we assume q§{£) = 1. From constraint C4 in dTOb . we obtain that the other selection variables 
are zero and therefore q%(i) — and q$(i) — 0. Moreover, if q%(i) — 1 then P*(i) 7^ and thus from (fl4ll). we 
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obtain v*(i) = 0. Using these results in J20b), we obtain 

^ ^ + p 1 ^l(^ + ^ T-TWrkn = ^ ln2 (26) 

1 + P r (z)5i(«) 1 + P r (t)5 2 («) 

The above equation is a quadratic equation and has two solutions for P r (i). However, since we have P r (i) > 0, we 
can conclude that the left hand side of (|26l> is monotonically decreasing in P r (i). Thus, if H 2 Si(i)+fiiS 2 (i) > 7hi2, 
we have a unique positive solution for P r (i) which is the maximum of the two roots of d2o*l l. Thus, we obtain 

-i + 



P r A/le (i) = 



-b + \/b 2 - Aac 
2a~ 



(27) 



where a = 7 ln2Si(i)S 2 (i)> b = 7 ln2(Si(i) + S 2 (i)) - (jii + ^)Si(i)S 2 (i), and c = 7 ln2 - mS 2 (i) - ^Si(i). 

In Fig. He), the comparison between selection metrics A^i) , A$(i) , and Ag(i) is illustrated in the space of 
(Si(i), S 2 (i)). We note that Ag(i) > A^i) and Ae(i) > As(i) hold and the inequalities hold with equality if 
5k(i) = and S\(i) = 0, respectively, which happen with zero probability for time -continuous fading. To prove 
Ae(i) > A^i), from ( fT6] l, we obtain 

A 6 (j) = mC r2 {i) + H2C r i(i) - 7P r {i)\ Pr{l)=p M 6(i) 
(«) 

> filC r2 (i) + H 2 C rl (l) - lPr{l)\ Pr(i)=p M l(e) 

(b) 

> H2C r l(i) ~ 7-Pr(«)|p r (i) = p^ 1 4 (i) = A 4 (i), (28) 

where (a) follows from the fact that P^ v<6 (i) maximizes Aq({) and (6) follows from /iiC r2 (i) > 0. The two 
inequalities (a) and (6) hold with equality only if S 2 (i) — which happens with zero probability in time-continuous 
fading or if /ii = 0. However, in Appendix O \i\ = is shown to lead to a contradiction. Therefore, the optimal 
policy does not select A^4 and M.5 and selects only modes .Mi, -^2,-^3, and ftA§. 

C. Optimal t*(i) 

To find the optimal t(i), we assume q^(i) = 1 and calculate the stationary condition in (fT2b). This leads to 

dC 1 

rr =-- ^(^1 - ^2) [C r (i) - Cir(i) - C 2r (i)} 



dt(i) N 

+Mi)-M*) = o ( 29 ) 

Now, we investigate the following possible cases for t*(i): 

Case 1: If < t*(i) < 1 then from (Q3) and (QjJ), we have </>t(i) = 0, I = 0,1. Therefore, from <|29) and 

C r (i) — Ci r (i) — C 2r {i) < 0, we obtain /j,i = \x 2 . Then, from d20k) and d20b). we obtain 

"E^ 1 - Ml) l+Pi^SiW+^WSaW +7 = 

.J_d_ ml M£> l~ = n 

I112V 1 WJ l+P t (»)Si(»)+P2(i)S 3 (0 +7 - u 
In Appendix ICl we show that /Ui 7^ 1, therefore, the above conditions can be satisfied simultaneously only if Si(i) = 
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52 (i), which, considering the randomness of the time-continuous channel gains, occurs with zero probability. Hence, 

the optimal t(i) takes the boundary values, i.e., zero or one, and not values in between. 

Case 2: If t*(i) = 0, then from (TBe). we obtain <j>\{i) = and from (fT3e). we obtain 4>o(i) > 0. Combining these 

results into (l29l i. the necessary condition for t(i) = is obtained as /zi > fi2- 

Case 3: If t*(i) — 1, then from (fT4F). we obtain </><j(i) = and from (TJJe). we obtain 0i(i) > 0. Combining these 

results into (1291 , the necessary condition for t(i) = 1 is obtained as /ii < /i2- 

We note that if fj,i = ^ 2 , we obtain either P 1 jV ' 3 (i) = or P 2 jV,3 (i) = 0, Vz. Therefore, mode M3 is not selected 
and the value of t(i) does not affect the sum rate. Moreover, from the selection metrics in ( fToT l. we can conclude 
that /ii > H2 and /ii < /i2 correspond to Hi > f^ and Hi < 0,2, respectively. Therefore, the optimal value of t(i) 
is given by 

(o, n,>n 2 

t*(i)={ (31) 

Now, the optimal values of qk(i),Pj(i), and t(i), Vi,j,k are derived based on which Theorem [2] can be 
constructed. This completes the proof. 

Appendix B 
Proof of Optimality of Binary Relaxation 

In this appendix, we prove that the optimal solution of the problem with the relaxed constraint, < qk{i) < 1, 
selects the boundary values of qk{i), i.e., zero or one. Therefore, the binary relaxation does not change the solution 
of the problem. If one of the qk(i), k = 1, . . . , 6, adopts a non-binary value in the optimal solution, then in order 
to satisfy constraint C4 in ®, there has to be at least one other non-binary selection variable in that time slot. 
Assuming that the mode indices of the non-binary selection variables are k' and k" in the i-th time slot, we obtain 
afc(«) =0, k = 1, . . . , 6 from (fl4k). and /3k'(i) = and /3k"(i) = from (fl4b). Then, by substituting these values 
into (1151 1. we obtain 

A(i)=Afc,(») (32a) 

A(t)=Afc//(i) (32b) 

A(i)-j0fc(t) = A fc (t), k^k',k". (32c) 

From d32h) and (l32b). we obtain A&/ (i) — Ak" (i) and by subtracting (l32h) and d32b) from (l32t), we obtain 

\ k ,(i)-A k (i)=p k (i), k^k',k" (33a) 

A fc „(t)-A fc (*) =&(*), k^k',k". (33b) 

From the dual feasibility condition given in (fT3b). we have (3 k (i) > which leads to Afc/(i) = Afe//(i) > Afc(i). 
However, as a result of the randomness of the time-continuous channel gains, Pr{A/->(i) = A k //(i)} > holds 
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for some transmission modes Mk' and Mk", if and only if we obtain Hi = 0, 1 or [ii = 0, 1 which leads to a 
contradiction as shown in Appendix [C] This completes the proof. 

Appendix C 
Threshold Regions 

In this appendix, we find the intervals which contain the optimal value of Hi and /i 2 . We note that for different 
values of [i\ and \l 2 , some of the optimal powers derived in (Bil l. (l22t . (1241 . (l25l l. and (f27b are zero for all channel 
realizations. For example, if fX\ = 1, we obtain P± l (i) = 0, Vi from (Bil l. Fig. [5] illustrates the set of modes that 
can take positive powers with non-zero probability in the space of (/Zi,/i2). In the following, we show that any 
values of \x\ and fi 2 except < fii < 1 and < /j, 2 < 1 cannot lead to the optimal sum rate or violate constraints 
CI or C2 in ([T0|. 

Case 1: Sets {Mi,M 2 ,M3} and {Ma,M§,M§} lead to selection of either the transmission from the users to 
the relay or the transmission from the relay to the users, respectively, for all time slots. This leads to violation of 
constraints CI and C2 in ( TTOb and thus the optimal values of \i\ and \x% are not in this region. 
Case 2: In set {Mi, Ma, Me}, both modes Ma and Me need the transmission from user 2 to the relay which can 
not be realized in this set. Thus, this set leads to violation of constraint C2 in (HOI , Similarly, in set {M 2 , M5, .Me}, 
both modes M5 and Me require the transmission from user 1 to the relay which can not be selected in this set. 
Thus, this region of Hi and ^2 leads to violation of constraint CI in ( TTOb . 

Case 3: In set {Mi, Ma, M5, Me}, there is no transmission from user 2 to the relay. Therefore, the optimal values 
of hi and /12 have to guarantee that modes .M4 and A4q are not selected for any channel realization. However, 
from (I161 l. we obtain 

Ae(i) = HiC r2 (i) +iu, 2 C r i(i) ~ lPr{i)\ Pr ^ =p M 6{i) 



(a) 

> MiC r2 (i) +M2C r i(i) -7P r (z)| 



Pr(i)=P -5 W 



> fiiC r2 (i) - jPr(i)\p r ( i)=p Ms {i) = A 5 (i), (34) 

where (a) follows from the fact that P^ v<6 (i) maximizes Ag(i) and (6) follows from HiC r i(i) > 0. The two 
inequalities (a) and (6) hold with equality only if Si(i) = which happens with zero probability for time- 
continuous fading, or \x 2 = which is not included in this region. Therefore, mode M.§ is selected in this region 
which leads to violation of constraint C2 in (TTOb . A similar statement is true for set ^2,^4,^5,^6}. Thus, 
the optimal values of [i\ and fj, 2 cannot be in these two regions. 
Case 4: In set {Mi, M 2 , M3, Ma, Me}, we obtain 

A 6 (j) = HlCr2{i) + H2Crl(i) ~ jPr(i)\ Pr{l)=p M 6{i) 

(«) 

< H2C r i(l) - jPr(l)\ pAi)=p M6 {i) 

(b) 

< H 2 C r l{l) - 7- P r(«j| Fr (i)=p^ ( 4 (i ) = A 4 («) (35) 
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P-i Ml = 1 



{Mi,Mi,Ms} 



{Mi,M 2 ,M 3 , 
Mi,Me} 



{Mi, Mz, M 9 } 



{Mi,Mt, 

M S ,M 6 } 



{Mi, Ma, 
Ms,Mi, 
M s ,M e } 



{Mi, Mi, 
Ms,M 6 , 

M 6 } 



{Mi,M 5 ,M 6 } 



-A*2 = 1 



{Ms, Mm 

M 5 ,M>} 



-»• Mi 



{Mt,M 5 ,M e } 



Fig. 5. Modes with non-negative powers in the space of (/ii,^)- 

where inequality (a) comes from the fact that /J.iC r2 (i) < and the equality holds when S%(i) = which happens 
with zero probability, or \x\ = 0. Inequality (6) holds since P^ /[i (i) maximizes Ai(i) and holds with equality only 
if P^ v,4 (i) = P/ /,6 (z) and consequently ix\ — 0. If /ii ^ 0, mode M.§ is not selected and there is no transmission 
from the relay to user 2. Therefore, the optimal values of n\ and [12 have to guarantee that modes VVfi and VW3 are 
not selected for any channel realization. Thus, we obtain /xi = 1 which is not contained in this region. If /ii = 0, 
from (TToT l. we obtain 

A 6 (i) =A 4 (i) = M 2 C r i(i) -7^ , r(i)| Pr(i)=P? Ai4 ( . l) 



(a) 

(6) 

< c lr (z)- 7 p 1 (i)! 



p r ( l )=p^ i4 (i) 



ft (i) = P«i W 



Ai(») 



(36) 



where both inequalities (a) and (b) hold with equality only if /i 2 = 1. If \Xi 7^ 1> modes .M4 and .Me are not 
selected. Thus, there is no transmission from the relay to the users which leads to violation of CI and C2 in ( fj~0b - 
If fi 2 = 1, we obtain P^ 2 ^) = 0, thus mode .M 2 cannot be selected and either p/^ 3 ^) = or P 2 jV,3 (i) = 0, 
thus mode M.% cannot be selected either. Since both modes .M4 and A4$ require the transmission from user 2 to 
the relay, and both modes .M 2 and .M3 are not selected, constraint C2 in ( TTOl ) is violated and /ii = and /i 2 = 1 
cannot be optimal. A similar statement is true for set {.Mi, M.%, At 3, M.5, M.&}- Therefore, the optimal values of 
fix and /i 2 are not in this region. 

Hence, set {Mi,A<2,-W3i .M4, -Ms, .Me} contains the optimal values of fi% and /i 2 , i.e., < Hi < 1 and 
< /i 2 < 1. This completes the proof. 
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